-
Notifications
You must be signed in to change notification settings - Fork 50
Add feature of hipBLASLt online tuning #815
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Fardin Hoque <[email protected]> Signed-off-by: Wentao Ye <[email protected]> Co-authored-by: Wentao Ye <[email protected]>
…e and Unsupported type' (vllm-project#26532) Signed-off-by: yewentao256 <[email protected]>
…deprecate use_inductor in favor of backend, simplify custom_ops (vllm-project#26502) Signed-off-by: morrison-turnansky <[email protected]> Signed-off-by: Morrison Turnansky <[email protected]> Signed-off-by: Luka Govedič <[email protected]> Co-authored-by: Luka Govedič <[email protected]> Co-authored-by: Jiangyun Zhu <[email protected]>
Signed-off-by: Fardin Hoque <[email protected]>
…lm-project#26728) Signed-off-by: Jialin Ouyang <[email protected]>
…dels (vllm-project#26526) Signed-off-by: Max de Bayser <[email protected]> Co-authored-by: Ayush Singh <[email protected]>
…24354) Signed-off-by: Lu Fang <[email protected]>
…t#26732) Signed-off-by: mgoin <[email protected]>
…cifying compile sizes (vllm-project#26681) Signed-off-by: angelayi <[email protected]>
…NSE (vllm-project#26742) Signed-off-by: Jialin Ouyang <[email protected]>
Signed-off-by: youkaichao <[email protected]>
Signed-off-by: Varun Sundar Rabindranath <[email protected]> Co-authored-by: Varun Sundar Rabindranath <[email protected]>
…-project#24024) Signed-off-by: n1ck-guo <[email protected]> Signed-off-by: Heng Guo <[email protected]> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
* add aiter rmsnorm and quant fusion kernel Signed-off-by: kliuae <[email protected]> * deprint Signed-off-by: kliuae <[email protected]> * disable aiter quant mm for compat Signed-off-by: kliuae <[email protected]> --------- Signed-off-by: kliuae <[email protected]> Co-authored-by: kliuae-amd <[email protected]>
…t#26602) Signed-off-by: Max de Bayser <[email protected]>
Signed-off-by: windsonsea <[email protected]>
…m-project#26723) Signed-off-by: Xiongfei Wei <[email protected]>
Co-authored-by: perzhang <[email protected]>
Signed-off-by: vllmellm <[email protected]>
…d. (alternative PR) (vllm-project#26718) Signed-off-by: Alexei V. Ivanov <[email protected]>
…#26758) Signed-off-by: Ryan Li <[email protected]>
…-project#26684) Signed-off-by: wangyafeng <[email protected]>
…26750) Signed-off-by: Ye (Charlotte) Qi <[email protected]>
… ` (vllm-project#20983) Signed-off-by: Max Wittig <[email protected]> Signed-off-by: Antoine Auger <[email protected]> Co-authored-by: Antoine Auger <[email protected]>
Signed-off-by: wangxiyuan <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
…6776) Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
…llm-project#26556) Signed-off-by: Chendi Xue <[email protected]>
…#26792) Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: ganyi <[email protected]>
Signed-off-by: ganyi <[email protected]>
Signed-off-by: ganyi <[email protected]>
Signed-off-by: ganyi <[email protected]>
[perf](moe): set fused_shared_expert to false
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
[Sync] Sync upstream and upgrade to version `0.11.1rc2.dev+ge9fce7b`
Ganyi/dsv3.2 rocm support
[Qwen] Add qwen3next script
* add aiter ops Signed-off-by: tjtanaavllm <[email protected]> * fix-format Signed-off-by: tjtanaavllm <[email protected]> * support fusion for aiter quant ops Signed-off-by: tjtanaavllm <[email protected]> --------- Signed-off-by: tjtanaavllm <[email protected]>
Update the deepseek readme
… table (#745) Signed-off-by: ganyi <[email protected]>
* update the condition to use bpreshuffle ptpc gemm Signed-off-by: tjtanaavllm <[email protected]> * remove print statement Signed-off-by: tjtanaavllm <[email protected]> * fix precommit Signed-off-by: tjtanaavllm <[email protected]> --------- Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: tjtanaavllm <[email protected]>
Signed-off-by: kliuae <[email protected]>
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run You ask your reviewers to trigger select CI tests on top of Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add If you have any questions, please reach out to us on Slack at https://slack.vllm.ai. 🚀 |
Purpose
Add feature of hipBLASLt online tuning
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.